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(57) Abstract 

The reverse two-hybrid method has been designed to provide a practical and efficient means of utilizing yeast cell-based assays to 
screen for molecules that can inhibit protein'i>rotein uiteractions of interest Existing two-hybrid systems involve recrastimtion in yeast of 
a transcriptional activator that drives expression of a "repoiter" geiie such as HIS3 or kicZ, Attempts to utilize these existing systems for 
drug discovery would necessarily involve screening for molecules that interfere with ttie transcriptional read-out, and would be subject to 
detecting any compound that non-specifically mterfered with transcription. In addition, since currently used reporter genes encode long-lived 
proteins, the assay would have to be perfomed over a lengthy time period to allow for decay of the preexisting rq>ortcr proteins. Any 
compound that would be toxic to yeast over this time period would also score as a "hit". The reverse two-hybrid interaction will avoid 
both of these pitfalls by drivmg the expression of a relay gene, such as the CAL80 gene, which encodes a protein that binds to and masks 
the activation domain of a transcriptional activator, such as Gal4. The reporter genes, which will provide the transcriptional read-out {HIS3 
or lasZ), are dependent upon functional Gal4 for expression. Only when the level of GalSO masking protein is reduced by interfering with 
the two-hybrid interaction will Gal4 function as a transactional activator, providing a positive transcriptional read-out for molecules that 
inhibit the two-hybrid protein-protein interaction. An important feature of the reverse two-hybrid system is that the basal level and half-life 
of the relay protein, GalSO. can be line-tuned to provide maximum sensitivity. 
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REVERSE TWO-HYBRID METHOD 

5 

TECHNICAL FIELD 

The invention relates to methods and compositions for identifying agents 
which modify intermolecular association between two or more polypeptides. Agents which 
10 specifically inhibit such protein*protein interactions are suitable for use as commercial 

reagents, pharmaceuticals, and for modulating gene expression in a cell culture and/or animal, 
such as to increase or decrease the expression of a predetermined protein in the cell culture or 
animal, and the like. 

15 BACKqROUW> 

Specific protein-protein interactions are fundamental to most cellular and 
organismal functions. Polypeptide interactions are involved in formation of functional 
transcription complexes, signal transduction patfaw^s, cytoskeletal organization (e.g., 
microtubule polymerization), polyp^tide hormone receptor-ligand binding, organization of 

20 multi-subimit enzyme complexes, and the like. 

Investigation of protein^rotein interactions under physiological conditions has 
been problematic. Considerable effort has been made to identify proteins that bind to proteins 
of interest. Typically, these interactions have been detected by using co-precipitation 
experiments in which an antibody to a known protein is mixed with a cell extract and used to 

25 precipitate the known protein and any proteins which are stably associated with it. This 

method has several disadvantages, such as: (1) it only detects proteins which are associated in 
cell extract conditions rather than under physiological, intracellular conditions, (2) it only 
detects proteins which bind to the known protein with sufficient strength and stability for 
efficient co-immunoprecipitation, and (3) it fails to detect associated proteins which are 

30 displaced from the known protein upon antibody binding. For these reasons and others, 
improved methods for identifying proteins which interact with a known protein have been 
developed. 

Two-Hvbrid Svstems 
One approach has been to use a so-called "two-hybrid" system to identify 
35 polypeptide sequences which bind to a predetermined polypeptide sequence present in a fusion 
proteui (Chien et al. (1991) Proc. Natl. Aca d. Sci. OJSA^ 9578). This approach identifies 
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protein-protein interactions in vivo through reconstitution of a transcriptional activator (Fields 
S and Song O (1989) Nature 245), the yeast Gal4 transcription protein. The method is 
based on the properties of the yeast Gal4 protein, which consists of separable domains 
responsible for DNA-binding and transcriptional activation. Polynucleotides encoding two 
5 hybrid proteins, one consisting of the yeast Gal4 DNA-binding domain fused to a polypeptide 
sequence of a known protein and the other consisting of the Gal4 activation domain fused to a 
polypeptide sequence of a second protein, are constructed and introduced into a yeast host 
cell. Intermolecular binding between the two fusion proteins reconstitutes the Gal4 DNA- 
bmding domain with the Gal4 activation domain, which leads to the transcriptional activation 
10 of a reporter gene (e.g., tocZ, HISS) which is operably linked to a Gal4 binding site. 
Typically, the two-hybrid method is used to identify novel polypeptide sequences which 
interact with a known protein (Silver SC and Hunt SW (1993) Mol. Biol. Rep. 17: 155; 
Durfee et al. (1993) Genes Devel. 2; 555; Yang et al. (1992) Science 257 : 680; Luban et al. 
(1993) £dl 71: 1067; Hardy et al. (1992) Genes Devel. ij; 801; Bartel et al. (1993) 
15 Biotechniques i±: 920; and Vojtek et al. (1993) £^ 74: 205). However, variations of the 
two-hybrid method have been used to identify mutations of a known protein that affect its 
binding to a second known protein (Li B and Fields S (1993) FASEB J. J: 957; Lalo ^ al. 
(1993) Proc. Natl. Acad. Sci. fUSA^ 2Q: 5524; Jackson et al. (1993) Mol. Cell. Biol. 12; 
2899; and Madura et al. (1993) J. Biol. Chem. 268 : 12046). Two-hybrid systems have also 
20 been used to identify interacting structural domains of two known proteins (Bardwell et al. 
(19931 med. Microbiol. 8: 1177; Clakraborty et al, (1992) J. Biol. Chem. 267 : 17498; 
Staudinger et al. (1993) J. Biol. Chem. 2fi8: 4608; and MUne GT and Weaver DT (1993) 
Genes D^v^|, 7; 1755) or domams responsible for oligomerization of a single protein 
awabuchi et al. (1993) Oncogene & 1693; Bogerd et al. (1993) J. Virol. £7: 5030). 
25 Variations of two-hybrid systems have been used to study the jn vivo activity of a proteolytic 
enzyme (Dasmahapatra et al. (1992) Proc. Natl. Ac ad, Sci. fUSA) S2: 4159). Alternatively, 
an E. coli/BCCP interactive screening system (Germino et al. (1993) Proc. Natl. Acad. Sgi. 
flLS^ 2Q: 933; Guarente L (1993) Proc. Natl. Aca d. Sci. fU.S.A.-> 2Q: 1639) can be used 
to identify interacting protein sequences (i.e., protein sequences which heterodimerize or form 
30 higher order heteromultimers). 

Each of these two-hybrid methods rely upon a positive association between 
two Gal4 fusion proteins thereby reconstituting a fiinrtional Gal4 transcriptional activator 
which then induces transcription of a reporter gene operably linked to a Gal4 binding site. 
Transcription of the reporter gene produces a positive readout, typically manifested either (1) 
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as an enzyme activity (e.g., iS-gaiactosidase) that can be identified by a colorimetric enzyme 
assay or (2) as enhanced cell growth on a defined medium (e.g., HISS). Thus, these methods 
are suited for identifying a positive interaction of polypeptide sequences, but are poorly suited 
for identifying agents or conditions which alter (e.g., inhibit) intermolecular association 

5 between two polypeptide sequences. In part, this is because a failure to obtain 

expression of the reporter gene can result from many events which do not stem from a 
specific inhibition of bmding of the two hybrid protems. For example, a two-hybrid system 
using a reporter gene that stimulates growth under defined conditions theoretically can be used 
to screen for agents diat inhibit the intermolecular association of die two hybrid protems, but 

0 it will be difficult or impossible to discriminate agents that specifically inhibit the association 
of the two hybrid proteins from agents which simply inhibit cell growth. Thus, an agent 
which is cytotoxic to yeast (e.g., bleach, phenol, ketoconazole, cycloheximide) will prevent 
cell growth without specifically inhibiting the interaction of two hybrid proteins and will score 
falsely as a positive hit. Similarly, a conventional two-hybrid system using a lacZ reporter 

S gene will falsely score general transcription or translation inhibitors (e.g., cycloheximide) as 
being inhibitors of two hybrid protein binding. Thus, two-hybrid systems that produce a 
positive readout contingent upon intermolecular binding of the two hybrid proteins are 
generally not suitable for screening iot agents which inhibit binding of the two hybrid 
proteins. 

0 Unfortunately, it would be desirable to have an efficient screening method for 

identifying compounds which specifically alter the intermolecular association between two 
known polypq)tide sequences under physiological conditions. Present two-hybrid methods 
rely on a positive readout and do not afford a m^od for identifying binding inhibitors (or 
binding competitors) with satisfactory sensitivity and/or selectivity. 

5 Thus, there is a need in the art for compositions and methods which can be 

used to efficientiy identify agents that specifically alter the intermolecular association between 
two polypeptide sequences in vivo . The present invention fulfills these and other needs. 

The references discussed herein are provided solely for their disclosure prior 
to the filing date of the present application. Nothing herein is to be construed as an admission 

} that the mventors are not entitied to antedate such disclosure by virtue of prior invention. All 
publications and patent applications herein are incorporated by reference to tiie same extent as 
if each individual publication or patent application was specifically and individually indicated 
to be incorporated by reference. 
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SUMMARY OF THE INVENTION 

The present invention provides several novel methods and compositions for 
identifying agents which alter intermolecular binding between two polypeptide species in a cell 
or in a cell-free transcription reaction. The invention relates to a general method, referred to 
5 herem as a reverse two-hybrid method, wherein agents which disrupt an int^moiecular 
association between two interacting polypeptides thereby generate a selectable and/or 
detectable readout (e.g., complementation of an auxotrophic phenotype, expression of a 
detectable reporter molecule, and the like). Typically, a reverse two4iybrid mediod produces 
a positive readout under conditions wh^ein an agent blocks or otherwise inhibits the 

10 intermolecular binding of the interacting polypeptides. A positive readout condition is 
generally identified as one or more of the following detectable conditions: (1) an increased 
transcription rate of a predetmnined reporteic gene, (2) an increased concentration or 
abundance of a polypeptide product encoded by a predetmnined reporter gene, typically such 
as an enzyme which can be readily assayed ia vivo , and/or (3) a selectable or otherwise 

IS identifiable phenotypic diange in an organism (e.g., yeast) harboring the reverse two-hybrid 
system. Graerally, a selectable or otherwise identifiable phenotypic change that characterizes 
a positive readout condition confers upon the organism eidier: a selective growth advantage on 
a defined medium, a mating phenotype, a characteristic morphology or developmental stage, 
drug resistance, or a d^ectable enzymatic activity (e.g., i3-galactosidase, luciferase, alkaline 

20 phosphatase, and the like). In this manner, it is possible to efficiently identify agoits 
(including but not limited to polypeptides, small molecules, and oligonucleotides) which 
inhibit intermolecular binding between two predetermined interacting polypeptides. 

In an aspea of the invention, a reverse two-hybrid system is composed of: (1) 
a first hybrid protein, (2) a second hybrid protein which binds to the first hybrid protein under 

25 control conditions (e.g., physiological conditions in the absence of agent), (3) a relay (or 

signal inverter) gene which is efficiently expressed as a consequence of the first hybrid protein 
and the second hybrid protein being functionally bound to each oth^, and (4) a rq)orter gene 
which is efficiently expressed when the product of the relay (or signal inverter) gene is 
substantially absent and is either poorly expressed or not expressed when the relay (or signal 

30 inverter) gene product is efficiently expressed. The first hybrid protein and second hybrid 
protein bind to each other through interacting polypeptide segments (i.e., a portion of the first 
hybrid protein preferentially binds to a portion of the second hybrid protein forming a 
heterodimer or higher order heteromultimer comprising the first and second hybrid proteins; 
said binding portions of each hybrid protein are termed "interacting polypeptide segments**). 



wo 95/26400 PCTAJS95/03918 

5 

The flrst hybrid protein comprises: (1) a first interacting polypeptide sequence 
in polypeptide linkage with (2) a DNA-binding domain of a transcriptional activator protein or 
other DNA binding protein (e.g., a repressor). The second hybrid protein comprises: (1) a 
second interacting polyp^tide sequence, enable of forming an intermolecular association with 
5 the first interacting polypqitide sequence under control conditions (e.g., physiological 
• conditions and absence of agent) in polypeptide linkage with (2) an activation domain of a 

transcriptional activator protein, whereby intermolecular binding between the first hybrid 
protein and the second hybrid protein (via the interacting polyp^tide sequences) thereby 
unit^ the DNA-binding domain of the first hybrid protein with the activation domain of the 

10 second generating a transcriptional activator function. Genially, the first hybrid protein and 
the second hybrid protein are encoded by polynucleotides which are consdtutively expressed in 
a host organism (e.g., a eukaryotic or prokaryotic cell, or multicellular organism). 

The relay gene (alternatively termed the signal inverter gene) is operably 
linked to a transcriptional regulatory sequ^ce (a "relay transcriptional regulatory sequrace") 

IS which is positively regulated by the transcriptional activator that is formed by the 

intermolecular binding of the first hybrid protein to the second hybrid protein. Hence, when 
the first hybrid protein binds to the second hybrid protein (via the interacting polypq)tide 
sequences), the transoiptional activator formed therd)y binds to a transcriptional r^ulatory 
sequence operably linked to the relay gene and enhances the net transcription of the relay 

20 gene. The relay gene encodes a protein that represses transcription of a rq)orter gene. Thus, 
when the first and second hybrid proteins are functionally bound to each other, the relay gene 
is expressed and thereby rq)resses transcription of the reporter gene(s). In an embodiment, 
such relay proteins are of the type often refim-ed to in the art as "negative regulators of 
transcription**. In an embodiment of the invention, the relay gene is a negative regulator of 

25 transcription in yeast; for example but not limitation the GAL80 gene can serve as a relay gene 
in yeast. In embodiments where host organisms are employed to harbor the reverse two- 
hybrid system, the relay gene is often a gene which naturally occurs in the germline DNA of 
the host organism species, and frequently can be an endogenous germline gene, or 
alternatively may be introduced into the host organism as exogenous DNA, typically into a 

30 host genome that lacks the corresponding functional endogenous gene (e.g., a "knockout 
background"). 

The reporter gene is operably linked to a transcriptional regulatory sequence 
("reporter transcriptional regulatory sequence") which is negatively regulated by the gene 
product of the relay gene and which is induced in the absence of the relay gene product. 
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Thus, transcription of the reporter gene is repressed in control conditions (e.g., physiological 
conditions in the absence of agent) wherein the two hybrid proteins bind to each other and 
form a transcriptional activator that increases transcription of the relay gene. Generally, the 
relay gene product eitiier binds to the transcriptional regulatory sequence operably linked to 

5 the reporter gene, or binds to a transcription protein that binds to the transcriptional regulatory 
sequence operably linked to the reporter gene. The net transcription rate of the reporter gene 
is reduced (or completely blocked) as a consequence of the relay gene produa binding to the 
reporter gene transcriptional regulatory sequence and/or to a transcription protein required for 
constitutive expression of the reporter gene. Any of a variety of r^rter genes that produce a 

10 positive readout can be used. For example and not limitation, suitable rq)orter genes are 
those which (1) confer a selectable phenotype to cells in whidi the reporter gene is efficientiy 
ej^ressed, and/or (2) encode a gene product (e.g., enzyme) which is convenientiy detected 
such as by ia silu assay or the like. Suitable genes which confer a selectable phenotype are 
exCTiplified by, but not limited to, genes which complement auxotrophic mutations in a host 

15 organism (e.g., yeast fflSJ), genes which encode drug resistance (e.g., necT), genes which 
induce cell proliferation, and other genes whose expression confers a selective growth 
advantage. Suitable genes which encode a gene product which is convenientiy detected jn s|U| 
are exwnplified by, but not limited to, ^-galactosidase (e.g.. ^ tocZ), luciferase, alkaline 
phosphatase, horseradish peroxidase, and the like. 

20 The invention provides polynucleotides encoding a first hybrid protein and a 

second hybrid protein. Sudi polynucleotides encode a DNA-binding domain or activation 
domain of a transcriptional activator and convenientiy can have a cloning site for adjacent 
insertion, in reading frame, of polynucleotide sequences encoding one or more interacting 
polypeptide sequence(s). Typically, a first polynucleotide will encode a first hybrid protein 

25 composed of a first predetermined interacting polypeptide sequence and a DNA-binding 
domain of a transcriptional activator; a second polynucleotide will encode a second hybrid 
protein composed of a second predetermined interacting polypq)tide sequence and an 
activation domain of a transcriptional activator, wherein the DNA-binding domain of the first 
hybrid protein can reconstitute with the activation domain and form a functional transcriptional 

30 activator. Often, the DNA-binding domain and the activation domain of the hybrid protein 
pair are derived ft'om the same naturally occurring transcription activator (e.g., Gal4). 
However, those of skill in the art can select DNA-binding domains and activation domains 
from distinct transcription activators which can reconstitute to form a functional transcriptional 
activator whidi does not occur in nature (e.g., a DNA-binding domain of the bacterial lexA 
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protein can be used in conjunction with a transcriptional activator from the viral protein, 
VP16; Vojtek et al. (1993) op.cit.) . Transcription and translation of such a polynucleotide 
produces a hybrid (or fusion) protein composed of an interacting polynucleotide segment and a 
DNA-binding domain or activation domain of a transcriptional activator. 
5 The invention also provides polynucleotides which comprise a transcriptional 

regulatory sequence operably linked to a relay (or signal inverter) gene. A relay (or signal 
inverter) gene encodes a protein that inhibits or otherwise rq)resses expression (typically 
transcription) of a predetermined reporter gene. Most usually, a relay protein is a negative 
regulator of transcription for a predetermined grae or gene subs^. In an embodiment, the 

10 relay protem is a transcription repressor protein that binds to a polynucleotide sequence and 
thereby inhibits transcription of a cis-linked and op^ably linked sequence. In an alternative 
embodiment, the relay protein binds to a protein that is a positive regulator of transcription of 
a predetermined gene or gene subset, and as a consequrace of bindmg thereby inhibits the 
transcriptional activity of the positive regulator. One variety of such a relay protein binds to 

15 and blocks the activation domain(s) of transcriptional activators. Although a variety of 
suitable relay proteins are apparent to those of skill m the art, this cat^ory of relay protein 
can be exemplified by the mammalian mdm2 oncoprotein which binds the transactivation 
domain of the tumor suppressor protein p5i, and the yeast GalSO protem which binds and 
inactivates the activation domain of Gal4. In an embodiment, the relay protein comprises a 

20 mutation, addition, or deletion that reduces the stability of the relay protein in vivo as 

compared to the naturally occurring cognate relay protein. Relay proteins can be referred to 
as signal inverter proteins, as they serve to invert a positive transcriptional signal (the 
reconstitution of a fanctional transcriptional activator by binding of the two hybrid proteins) 
into a negative transcriptional signal, which reduces transcription of a predetermined rq)orter 

25 gene. Generally, a polynucleotide encoding a relay protein is operably linked to a relay 
transcriptional regulatory sequence that produces transcription of the relay gene dependent 
upon functional reconstitution of the DNA-binding domain and activation domain of the two 
hybrid proteins. For example and not limitation, such a relay transcriptional regulatory 
sequence can comprise a promoter and a polynucleotide sequence comprising one or more 

30 site(s) which bind(s) a reconstituted functional transcriptional activator formed by association 
of the two hybrid proteins; for example, if the two hybrid transcriptional activator comprises a 
lexA DNA-bmding domain, the relay transcriptional regulatory sequence operably linked to the 
relay gene can comprise one or more lexA binding site sequences, arrayed in tandem. 

The invention also provides polynucleotides which comprise a transcriptional 
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regulatory sequence operably linked to a r^orter gene. The reporter gene encodes a protein 
that confers a selectable phenotype on a host cell and/or can be detected by an in vj^s assay, 
such as an la situ enzymatic assay (e.g., host cells expressing a ladZ reporter can be detected 
as blue staining cells in the presence of X-gaJ). A transcriptional regulatory sequence 
5 operably linked to die reporter gene comprises a promoter and generally produces constitutive 
transcription of the relay gene contingent upon the substantial absence of the relay protein. In 
an wnbodunent, the r^rter transcriptional regulatory sequence operably linked to the 
rq)orter gene comprises a binding site for a relay protein, wherein binding of die rel^ protein 
to the reporter transcriptional regulatory sequence inhibits constitutive transcription of the 
10 rqiorter gene. In an alternative embodiment, die reporter transcriptional regulatory sequence 
linked to the reporter gene comprises a binding site for a transcriptional activator protein, 
wherein binding of a constitutive transcriptional activator protem to the reporter transcriptional 
regulatory sequence produces constitutive transcription of die cis-linked rqwrter gene, and 
wherein die rday protein binds to or odierwise inactivates die transcriptional activator protein, 
15 diereby repressing constitutive egression of die r^rto- gene. 

The invention also provides host organisms (Q^pically unicellular organisms) 
which harbor a reverse two-hybrid system, Qrpically in die form of polynucleotides encoding a 
first hybrid protein, a second hybrid protein, a relay gene, and/or a r^rter gene, wh«ein 
said polynucleotide(s) are eidier stably replicated or introduced for transient expression. In an 
20 embodiment, die host organism is a yeast cell (e.g., Saccharomvces cervisiae^ m which die 
germline GAL80 gene is functionally inactivated, die relay gene encodes Gal80, and die 
reporter gene transcriptional regulatory sequ»ice comprises a Gal4-responsive promoter. 

The invention also provides a mediod for identifying agents diat inhibit 
binding of a first interacting polypq)tide to a second intwacting polypeptide. The mediod 
25 employs die reverse two-hybrid system described supra, wherein a first hybrid protein 
comprises die first interacting polypeptide and a second hybrid protein comprises a second 
interacting polypeptide. Heterodimerization (or higher order heteromultimerization) between 
the first hybrid protein and die second hybrid protem produces transcription of a relay gene 
encoding a protein which inhibits expression of a rqwrter protein. Host organisms harboring 
30 such a reverse two-hybrid system are cultured in die presence of an agent, such as a diffusible 
small molecule (typical MW < 5,000, preferably < 1,000) or a transfected cDNA expression 
polynucleotide encoding a polypeptide agent, and expression of die host organism reporter 
gene is determined and standardized to a parallel blank culmre which lacks an agent. Agents 
which produce a gnificant increase in expression of die reporter gene ui a host organism 
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after a suitable time period (e.g., usually at least 1 hour, often at least 3 hours, preferably 
about 6 hours, occaisionally overnight or longer) are thereby identified as inhibitors for 
blocking the intermolecular association between the first an second interacting polypeptide 
sequences. Such protem interaction inhibitors are candidate drugs for pharmaceutical use 
S and/or for use as commercial research reagents. In an embodiment of the invention, yeast 
cells are the host organism, the reporter gene encodes jS-galactosidase and/or a protein that 
complements an auxotrophic mutant yeast host cell, and the first and second interacting 
polypeptide sequences each comprise a bmding domain derived from a signal transduction 
protein. 

10 The invention also provides a kit comprising a reverse two-hybrid system, a 

host cell, and an instruction manual. Such kits may optionally include a panel of agents for 
testing. 



PETAILED DESCWnON 

15 Definitions 

Unless defined otherwise, all technical and scientific terms used herein have 
the same meaning as commonly undmtood by one of ordinary skill in the art to which this 
mvention belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, the prefierred 

20 methods and materials are described. For purposes of the present invention, the following 
terms are defined below. 

In the polypq)tide notation used herein, the leftiiand direction is the amino 
terminal direction and the righthand direction is the carboxy-terminal direction, in accordance 
with standard usage and convention. Similarly, unless specified otherwise, the lefthand end of 

25 single-stranded polynucleotide sequences is the 5' end; the lefthand direction of double- 
stranded polynucleotide sequences is referred to as the 5* direction. The direction of 5* to 3* 
addition of nascent RNA transcripts is referred to as the transcription direction; sequence 
regions on the DNA strand having the same sequence as the RNA and which are 5' to the 5* 
end of the RNA transcript are referred to as "upstream sequences'*; sequence regions on the 

30 DNA strand having the same sequence as the RNA and which are 3' to the 3' end of the RNA 
transcript are referred to as "downstream sequences". 

The term "naturally-occurring" as used herein as applied to an object refers to 
the fact that an objea can be found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) that can be isolated from a source 
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in nature and which has not been intentionally modified by man in the laboratory is naturally- 
occurring. 

The term "agent" is used herein to denote a chemical compound, a mixture of 
chemical compounds^ a biological macromolecule, or an extract made from biological 
5 materials such as bacteria^ plants, fiingi, or animal (particularly manunalian) cells or tissues. 
Agents are evaluated for potential activity as specific protein interaction inhibitors (i.e., an 
agent which selectively inhibits a binding interaction between two predetermined polypq)tides 
but which does not substantially interfere with cell viability) by inclusion in screening assays 
described hereinbeiow. 

10 The term "protein interaction inhibitor" is used herein to refer to an agent 

which is identified by one or more screening method(s) of the invention as an agent which 
selectively inhibits protein-protein bindmg between a first interacting polypeptide and a second 
interacting polyp^tide. Some protein interaction inhibitors may have therapeutic potential as 
drugs for human use and/or may serve as commercial reagents for laboratory researdi or 

IS bioprocess control. Protein interaction inhibitors which are candidate drugs are then tested 
further for activi^ in assays which are routinely used to predict suitability for use as human 
and veterinary drugs, including in vivo administration to non-human animals and often 
including administration to human in ^proved clinical trials. 

As used h^in, the t^m "operably linked" refers to a linkage of 

20 polynucleotide elements in a functional relationship. A nucleic acid is "operably linked" when 
it is placed into a functional relationship with anoth^ nucleic acid sequence. For instance, a 
promoter or enhancer is operably linked to a coding sequence if it affects the transcription of 
the coding sequence. Operably linked means that the DNA sequences being linked are 
typically contiguous and, where necessary to join two protein coding regions, contiguous and 

25 in reading frame. However, since enhancers gienerally function when separated from the 
promoter by several kilobases and intronic sequences may be of variable lengths, some 
polynucleotide elements may be operably linked but not contiguous. 

As used herein, the term "endogenous DNA sequence" refers to naturally- 
30 occurring polynucleotide sequences contained in a eukaryotic or prokaryotic cell. Such 
sequences include, for example, chromosomal sequences (e.g., structural genes, promoters, 
enhancers, recombinatorial hotspots, repeat sequences, integrated proviral sequences). A 
"predetermined sequence" is a sequence which is selected at the discretion of the practitioner 
on the basis of known or predicted sequence information. An exogenous polynucleotide is a 
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polynucleotide which is transferred into a eukaryotic or prokaryotic cell. 

As used herein the term "physiological conditions" refers to temperature, pH, 
ionic strength, viscosity, and like biochemical parameters which are compatible with a viable 
organism, and/or which typically exist intracellularly in a viable cultured yeast cell or 
5 mammalian cell. For example, the intracellular conditions in a yeast cell grown under typical 
laboratory culture conditions are physiological conditions. Suitable in vitro reaction conditions 
for m vitro transcription cocktails are generally physiological conditions. In general, Iq vitro 
physiological conditions comprise 50-200 mM NaCl or KCl, pH 6.5-8,5, 20-45*C and 0.01- 
10 mM divalent cation (e.g., Mg*+, Ca*+); preferably about 150 mM NaQ or KQ, pH 7.2- 

10 7.6, 5 mM divalent cation, and often include 0.01-1.0 percent nonspecific protein (e.g., 
BSA). Particular aqueous conditions may be selected by die practitioner according to 
conventional m^ods. For general guidance, die following buffered aqueous conditions may 
be ^plicable: 10-250 mM NaQ, 5-50 mM Tris HQ, pH 5-8, with optional addition of 
divalent cation(s) and/or metal chelators and/or nonionic detergents and/or mrabrane 

15 fractions. 

The tenns "functional disruption" or "functionally disrupted" as used herein 
means that a gene locus conq)rises at least one mutation or structural alteration such that the 
functionally disrupted gme is substantially incapable of directing the efRcient expression of 
functional gene product. 

20 As used herein, the terms "interacting polypeptide segment" and "interacting 

polypeptide sequence" refer to a portion of a hybrid protein whidi can form a specific bindmg 
interaction with a portion of a second hybrid protein under suitable binding conditions. 
Generally, a portion of the first hybrid protein preferentially binds to a portion of the second 
hybrid protein forming a heteroduner or higher order heteromultimer comprising the first and 

25 second hybrid proteins; the binding portions of each hybrid protem are termed interacting 
polypeptide segments. 

Description of the Invention 

Generally, the nomenclature used hereafter and the laboratory procedures in 
30 cell culture, molecular genetics, and nucleic acid chemistry and cell culture described below 
are those well known and commonly employed in the art. Standard techniques are used for 
recombinant nucleic acid methods, polynucleotide synthesis, and microbial culture and 
transformation (e.g., electroporation, lipofeaion). Generally enzymatic reactions and 
purification steps are performed according to the manufacturer's specifications. The 
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techniques and procedures are generally performed according to conventional metiiods in the 
art and various general references (sgg, generally , Sambrook et al. Molecular Cloning: A 
Laboratory Manual, 2d ed. (1989) Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., which is incorporated herein by reference) which are provided throughout this 
5 document. The procedures therein are believed to be well known in the art and are provided 
for the convenience of the reader. All the information contained therein is incorporated herein 
by reference. 

The reverse two-hybrid method is generally ^plicabie for identifymg agents 
which inhibit binding between a varied of predetermined interactmg polypq>tides. 

10 

Overview 

A basis of the present invention is a strategy for screening a bank of ag^ts 
with a reverse two-hybrid systmi to identify agents which inhibit the intennolecular 
association of two interacting polypeptide sequences. Thus, in a reverse two-hybrid system 

15 there is at least one pair of interacting polypeptide sequences, witii a first inteacting 

polypeptide sequence present in one of the hybrid protein species and a second intoacting 
polypeptide sequence present in the other hybrid protein species. The choice of interacting 
polypeptide sequences incorporated in a reverse two-hybrid system is selected at the discrrtion 
of the practitioner. For example, a reverse two-hybrid system suitable for identifying agents 

20 which mhibit Fos/Jm leucine zipper formation may be composed of a first hybrid protein 
having an interacting polypeptide sequence comprising a Fos leucine zipper and a second 
hybrid protem having an interacting polypeptide sequence comprising a Jun leucine zipper. A 
variety of interacting protein sequences can be used; for example and not limitation, tiiese 
mclude: transcription factor bmdmg domains, multisubunit proteins, signal transduction 

25 protems (G protems, members of ras/raf/MEK signaling pathway(s), tumor suppressor protein 
bindmg domains (/», p5i), and the like), polypeptide ligands and their cognate receptor(s), 
active sites of enzymes which catalyze reactions mvolving binding to a polypeptide substrate 
and the polypeptide substrate itself, and essentially any pair of protein sequences which form 
an intermolecular association under physiological conditions. Generally, interacting 

30 polypeptides form heterodimers with a dissociation constant (Kd) of at least about 1 x 1{? M"*, 
usually at least 1 x 10* U'\ typically at least 1 x 10* U \ preferably at least 1 x 10* M'* to 1 
X 10' M"* or more, under suitable physiological conditions. 

Reverse two-hybrid systems are used for detecting die ability of agents to 
inhibit the intermolecular bmdmg of two interacting polypeptides and provide for fecile high- 
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throughput screening of agent banks (e.g., compound libraries, peptide expression libraries, 
and the like) to identify protein interaction inhibitors which preferentially inhibit 
intermolecular binding between two predetennined interacting polypeptide species. Such 
protem interaction inhibitors (specific binding antagonists) can modulate biochemical activity 
5 of the predetermined mteracdng specie(s) and thereby modulate biological function. Agents 
which alter the intermolecular association of the two interacting poIypq)tide sequences in the 
hybrid proteins, generally by inhibiting heterodimeric binding of the two hybrid proteins, 
score positively in the reverse two-hybrid system. The protein interaction inhibitors thereby 
identified are candidate drugs for human and veterinary therapaitic use and/or are suitable 

10 commercial reagents for laboratory research or bioprocess control. 

An agent enable of specifically inhibiting protein-protein binding of a 
ther^)^tically relevant protein interaction m xiYQ can be used for ther^y of disease or fior 
modulation of g^e expression in cells and organisms. Typically, an efficacious dose of a 
protein mteraction inhibitor is administered to a patient as a ther^)eutic or prophylactic for 

15 treating a pathological condition (e.g., cancer, inflammation, lymphoproliferative diseases, 
autoimmune disease, and tiie like). 

Description of the Pr efierred Embodiment 

In order to Ulustrate the invention, a description of a preferred embodiment is 

20 presented below. This embodiment comprises a reverse two-hybrid system in yeast cells that 
are functionally disrupted for endogenous GAL80 expression, wherein the intermolecular 
association of the first and second hybrid protems activates transcription of a GAL80 relay 
gene. Expression of GAL80 represses the transcriptional activity of constitutively expressed 
Gal4 protein and inhibits transcription of a (jal4^ependent reporter gene. 

25 A variety of alternative embodiments and variations will be apparent to those 

of skill in the art, including alternative relay genes, alternative host cells (e.g., mammalian, 
bacterial, fungal, insect, and the like), variations of the basic reverse two-hybrid method, and 
others. 

30 Two-Hvbrid svstems 

Transcriptional activators are proteins that positively regulate the expression of 
specific genes. They can be functionally dissected into two structural domains: one region 
that binds to specific DNA sequences and thereby confers specificity, and another region 
termed the activation domain that binds to protein components of the basal gene expression 
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machinery (Ma and Ptashne (1988) fieU 55: 443). These two domains need to be physically 
connected in order to function as a transcriptional activator. Two-hybrid systems exploit this 
finding by hooking up an isolated DNA bmduig domain to one protein (protein X), while 
hooking up the isolated activation domain to another protein Oprotein Y). When X and Y 
5 interact to a significant extent, the DNA binding and activation domains will now be 
connected and the transcriptional activator function reconstituted (Fields and Song (1989) 
Nature 245). The yeast host stram is engineered so that the reconstituted transcriptional 
activator drives the expression of a specific reporter gene such as HISS or lacZ^ which 
provides the read-cut for the protein-protein mteraction (Field and Song (1989) op.cit. : Chein 

10 et al. (1991) op.cit.) . One advantage of two-hybrid systems for monitoring protein-protein 
interactions is their sensitivity in detection of physically weak, but physiologically inqportant, 
protem-protein interactions. As such it offers a significant advantage ov^ other methods for 
detecting protein-protein mteractions (e.g., ELISA assay). Unlike the ELISA assay, 
however, the current two-hybrid system is not readily transplantable to drug screening 

15 operations. A major problem with the existing two-hybrid methods is that nonspecific 
inhibitors of transcriptional activation score the same as inhibitors of the specific protein- 
protein interaction. 

Negative Regulators of Transcription 
To address the aforementioned problem, the read-out of the conventional two- 

20 hybrid interaction can be reversed by interposition of a relay gene which serves to invert the 
ouq)ut produced from interaction of the two hybrid proteins from a positive transcriptional 
activator to a negative transcriptional regulator (e.g., repressor). In order to invert the 
readout from a positive transcription activator to a negative transcription rq)ressor, it is 
possible to take advantage of the properties of certain negative regulators of transcription. In 

25 an embodiment, some of these negative regulators block the function of specific transcriptional 
activators by binding to their activation domain. Two such examples are the mdm2 
oncoprotein which binds to and masks the trans-activation domain of the tumor suppressor 
protein p53 (Momand et al. (1993) Cell 69: 1237; Oliner et al. (1993) Nature 362 : 857), and 
the yeast Gal80 protein which binds and inactivates the transcriptional activator region of Gal4 

30 (Ma and Ptashne (1987) Ceil 5Q: 137; Johnston and Carlson (1993) Regulation of Carbon and 
Phpsphate Metabolism, vol. 2, Cold Spring Harbor Press, Cold Spring Harbor, New York). 
By designing the two-hybrid interaction to drive the expression of a negative regulator of a 
specific transcriptional activator, the resultant system is such that interference with the two- 
hybrid interaction results m increased activity of a transcriptional aaivator and hence a 
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positive signal. In view of the fact that the biology of the Gal80-Gal4 system is well 
understood in yeast, this pair of negative-reguiators/transcriptional activators is suitable for the 
reverse two^ybrid method. In prmciple, the pair of mdm2 and p53 proteins, or any other 
matched pair of transcriptional activator and specific negative regulator, will also work. 

S In the present embodiment, the two mteracting hybrid proteins reconstitute a 

transcriptional activator composed of a DNA binding domain derived from the bacterial 
protein encoded by lexA and an activator domain derived from the viral protein VP16 (Vojtek 
et al. (1993) op.cit.) . The reconstituted lexA/VP16 transcriptional activator binds to a relay 
gene operably linked to a transcriptional r^ulatory 

10 sequence containing tandem copies of a lexO binding site sequence which binds the lexA 
DNA-binding domain. Upon binding of the reconstituted lexA/VP16 transcriptional activator 
to the lexO binding site(s), the operably linked relay gene (GALSO) is efficiently expressed. 
Thus, when the two hybrid proteins are associated (e.g., as a heterodimer or the like),! the 
GALSO relay gene is expressed and serves to repress expression of a r^rt^ gene construct 

15 

Th^ qalgO-gftl4 gyytem 
The Gal80-Ga]4 system of regulatory proteins underlies die ability of yeast 
cells to respond to exogenously added galactose and specifically synthesize the enzymes 
needed to utilize it as a carbon/energy source (lohnston and Carlson (1993) op.cit. . Unless 

20 galactose is present, the GalSO protein binds and blocks the function of the transcriptional 
activator Gal4. In the absence of the GALSO gene, the transcriptional activator function of 
Gal4 is not masked and hence expression of galactose-regulated genes no longer requires 
galactose for induction. In the reverse two-hybrid systra, the host strain generally is 
functionally disrupted for the endogenous GALSO gene, but GalSO protein is provided through 

25 a two-hybrid driven relay gene construa (see. Experimental Example, infra) which is operably 
linked to a transcriptional regulatory sequence that binds a bacterial lexA DNA-binding 
domain present in the first hybrid protein. When the two-hybrid interaction is driving the 
expression of the relay gene product, GalSO, the GaJ4-induction of the reporter gene(s) is 
inhibited. When the two-hybrid interaction is blocked, the relay gene (GALSO) expression will 

30 be turned off and the Gal4-dependent transcriptional regulatory sequence operably linked to 
the reporter gene(s) is then able to drive expression of the reporter gene(s). 



TechniQues for Fine-Tuning Expression of the Relav Gene 
The sensitivity of this system can be modulated by adjusting the amount of 
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Gal4 or Gal80 protein. A host strain generally contains the wild-type GAL4 gene and hence 
contains very low levels of Gal4 when the yeast cells are cultured with carbon/energy sources 
such as raffinose (Johnston and Carlson (1993) op.cit,) . If necessary, the levd of Gal4 
protein can be decreased by at least five-fold by culturing the cells in glucose (Griggs and 
5 Johnston (1991) Proc. Natl. A cad. Sci, OJSA^ fig: 8597). Higher levels of Gal4 protein can 
be provided by transforming the strain with a multicopy plasmid encoding Gal4 (Schultz et al. 
(1987), 

The amount and/or stability of the relay protein, GalSO, can also be adjusted* 
Prefw^ly, the stability of the GaI80 protein is sufficient such that the addition of protein 
10 interaction inhibitor agents generates a detectable readout of the reporter gene(s) within about 
six hours, or most usually widiin the time-frame of an overnight assay. For this to be a 
convenient assay approach, Gal80. activity preferably deteriorates at a rapid rate when active 
inhibitor agents are added and the two-hybrid system is inhibited. The half-life of Gal80 
proteins in yeast cells has not been rigprously defined in the art. If Gal80 has a short half- 

15 life, it is generally only necessary to vary the level of transcription of GAL80 by changing 
either copy number of the two-hybrid relay gene construct or by varying the number of 
bindmg sites for the transcriptional activator (e.g., lexO operator sequences) in die 
transcriptional regulatory sequence of the relay gene construct. If Gal80 has an inordinately 
long half-life, it is preferable to engmeer a chimeric GaI80 protein with a shorter half-life. 

20 Successfiil engineering of long-lived proteins to proteins with shorter half-lives has been 
achieved by addition of PEST sequences to DHFR (Loetsdier et al. (1991) J. Biol. Chem. 
26^:11213) or by forming j3-galactosidase variants with different N-terminal residues by m 
vivQ processing of ubiquitin-/8-gaIactosidase fusions (Varshavsky et al. (1989) Yeast Gen^ic 
Engineering, Barr, Brake, and Valenzuela (eds.), Butterwortihs, pp. 109-143). TTie latter 

25 method has been well characterized in yeast, such that Gal80 variants with half-lives ranging 
from 2 minutes to over 24 hours can be readily generated. 

The following examples are offered by way of example and not by way of 

limitation. 

EXPERIMENTAL EXAMPLES 
30 Construction of the appropriate host veast strains 

Since the GAL80-GAL4 system is employed, the reporter genes in the yeast 
strain need to be operably linked to promoters that are responsive to Gal4. Reporter genes 
that have been operably linked to Gal4-responsive promoters were integrated into yeast strams 
(see Construction of Yeast Strains, infra). One of the reporter genes encodes jS-galactosidase, 
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whose expression allows quantitative transcriptional read-out, if desired. It is possible to 
utilize other reporter genes operably linked to Gal4-responsive promoters, such as ones 
encoding alkaline phosphatase, that would also allow easy quantitation of transcriptional read- 
out. JEY8, JEYIO, and JEY12, three independent progenitors for the reverse-two hybrid host 
S strains, were derived by standard genetic methods from a cross between YM2170 QAATa ura3 
his3 ade2 tys2 tyrl GAL4^ galSOA LEU2:GALl-lacZ; available from Dr. Mark Johnston, 
Washington University, St. Louis, MO) and YPB2 (MATa his3 adel leu2 ura3 tys2 trpl ami 
gal4A gaiaOA LYS2::GAL1'H1S3 URA3::GAU'lacZ) (Bartel et al. (1993) in Cellular 
Interactions in Development: A Practical Approach, Hartley DA (ed.) Oxford University 

10 Press, Oxford, UK, pp. 153). Hie progenitor strains (MATa his3 ade2 leu2 ura3 fys2 trpl 
GAL4* galSOA LYS2::GAL1'HIS3 URA3:.'GALl-lacZ) contain all the necessary rq)orter genes 
and have been tested for a functional Gal4 protein and rq)orter genes by analysis of galactose- 
induced expression of i3-galactosidase. To test m general wh^er re-mtroduction of Gal80 
protein negatively regulates Gal4 in die system, JEY8 was transformed with a highnxfy 

IS plasmid containing die wild-type GAL80 gene (pBM260; available from Mark Johdstoni,:. . 
Washington University, St. Louis, MO). Sufficient expression results in inhibition of the 
read-outs from the rq>orter genes {H1S3 and lacZ)^ which are determined by assaying rj3r 
galactosidase activity and growth in the absence of hisddine. Both of these reporter activities 
are scored in yeast grown on plates containing raffinose (which allows for foil activityiof 

20 GalSO protein) and galactose (which inactivates Gal80 protein). These tests confirm that in 
these strains the GalSO protein, expressed off its endogenous promoter, suppresses Ga]4 
foncdon. Twa4iybrid constructs are evaluated for their ability to drive sufficient GALSO 
expression from the LexO-GALSO fusion plasmids that are constructed (s^ infirdi. 

25 CppstruptioPi of ftg L^?i;0-GAi^W f\^m ?fim 

A chimeric gene (a LexO-GAL80 fusion) is constructed and serves as the relay 
(signal inverter) gene. The DNA-binding domain of the transcriptional activator that is used 
to drive expression of the relay gene is derived from the bacterial protein encoded by lexA and 
has been used before in two-hybrid systems as a fusion with the transcriptional activator from 

30 the viral protein VP16 (Vojtek et al. (1993) opxit. . Other transcriptional activators that have 
a defined DNA binding site, such as the ACEl gene product of S. cerevisiae (Munder and 
Furst (1992) Mol. Cell. Biol. 12: 2091) may be used. The LexO sites are generated by 
mutually primed synthesis fceg. Chapter 8.2A in Current Protocols in Molecular Biology 
(1990) Ausubel, Brent, Kingston, Moore, Seidman, Smith, and Struhl (eds.), Greene 



wo 95/26400 PCTAJS95/03918 

18 

Publishing Associates and Wiley Interscience, New York, NY) using the oligomer 
5'- 

GCGAATTCCTACrGTATATACATACAGTACCATCTACTGTATATACATACAGTAGC 
CGCTCGAGCGGC-3* [SEQ. ID NOilJ. The resultmg fragment contains four consensus 
S LexA binding sites in tandem. The ONA product is digested with EcoRI and inserted into the 
EcoRI site of pCZD (Lue et al. (1989) Proc. Nad. Acad. Sci, fUSA) M: 486) to generate 
pCZD-LexO. The pCZD vector contains a minimal TATA box for recognition of the basal 
transcriptional machinery but requires the addition of specific DNA sequences to effectively 
function as promote box. The Gal80 codmg sequence is isolated by PGR using ihe following 

10 two oligomers: S'-CGCGGATCCCGTTCnTCCAC TCCCG-3* [SEQ. ID N0:2]; and 

5*-CGGATCCGATGGAAGGATGCCCGCTGCTGC-3* [SEQ. ID N0:3]. The template is the 
plasmid pBM260 which contains the GAL80 gene subcloned in YEpI3 (available from Mark 
Johnston, Washington University, St. Louis, MO). The GAL80 PGR product is digested with 
BamHI and inserted mto the BamHI site of pCZD-LexO to create pLexO-GalSO. The LexO- 

15 Gal80 fusion is then subcloned mto pGalileo, a 2|t based yeast shuttle vector (20-30 copies per 
cell) c^ing the ADE2 selectable marto (available from Avtar Roopra, Washington 
University, St Louis, MO) to geno^ pJE42. From this plasmid, GEN- and integrating 
versions are constructed to provide a means of controlling the level of expression of GAL80 
by the two-hybrid mteraction. For example, the basal transcription from the 2fi plasmid may 

20 express sufficient Gal80 to require galactose for expression of the rq)orter gene(s) even in the 
absence of a lexA-based transcriptional activator. Additionally, LexO-ubiquitin-Gal80 fusions 
encoding a shortened half-life GalSO protein is constructed. 

In order to demonstrate that relay gene constructs comprising a LexO-GALSO 
polynucleotide fusion can be activated by the two-hybrid interaction to sufficient levels for 

25 regulating the Gal4-mediated reporter gene expression, a positive control is generated. Yeast 
are transformed with a plasmid that contains a fusion of the DNA binding domam (lexA) and 
transcriptional activation component (VP16) of the two-hybrid system and activates 
transcription of die relay gene LexO-GALSO fusion. The plasmid pLEX-VP16 (available from 
A. Vojtek; Vojtek et al. (1993) op.cit.^ is used for die positive control. The abUity of two- 

30 hybrid interactions to drive expression of the relay gene is demonstrated. 



Testing for thg ability of two-hvbrid interactions to drive expression of t he LexO-GAL8Q 
fusions 

The two-hybrid interaction that is used to test for its ability to drive sufficient 
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expression of the LexO-GALSO relay gene is the interaction of human H-ras p21 with human 
c-Raf (Van Aelst et al. (1993) Proc. Natl. Ac ad. Sci. OJSA^ §0: 6213; Vojtek et al. (1993) 
ofixiL). K-ras is linked by in-frame polynucleotide fusion to the VP16 activation domain, and 
Raf is linked by in-frame polynucleotide fusion to the DNA binding domain of the lexA gene 
5 product. pGBT8-Raf was constructed by ligating EcoRI and PstI linkers to a Raf coding 
sequence isolated by PCR amplification of a human placental cDNA library from Stratagene 
(San Diego, CA) as described by MacDonald ^ al. (1993) Mol. Cell. Biol. JLi: 6615. The 
iZafgene was cut out of pGBT8-Raf as an EcoRI to PstI fragment and subcloned into the 
EcoRI-PstI site of pBTMl 16 (that contains the LexA DNA bmding domain (Vojtek et al. 

10 (1993) i2iL£iL) to generate pBTM-Raf (pTESd). The EcoRI site mflintumg the same reading 
frame. pGBT8K-ras was constructed by PCR amplification of pEXV-K-ras (Hancock et al. 
(1990) £^ ^: 133) such that the K-ras sequence is isolated as a Sall-PstI restriction fragment 
which was then subcloned into Sall-Pstl-cut pGBT8. To construct pVPK-ras (pJE44), a PCR 
product of pGBT8K-ras was generated using the following oligomers as PCR amplimeis: 

15 5*-CGGGATCCATGACTGAATATAAACrTGTGGTAG-3' [SEQ. DO NO:4] 

5*-CGGGATCCTrACATAATTACACACTTTGTCrTTCACITG-3* [SEQ, ID NO:5] 
and the resultant PCR product was digested with BamHI and subcloned into the BamHI site of 
pVP16 (Vojtek et al. (1993) op.cit.^ to generate pVPK-ras (pJE44). The Lsx(X}AL80 relay 
gene plasmid, the pBTM-Raf and ±e pVP-K-ras (pJE44) plasmids are cotransfected into a 

20 host yeast strain and the ability of the two-hybrid interaction to drive sufficient e}q)ression of 
GAL80 to prevent the expression of the reporter genes (lacZ and HISS) is determined. 
Growth on galactose is used as an internal positive control to ensure that the promoter is still 
functional. 

25 Testing for the ability of a small molecule to interfere with a two-hvbrid interaction 

The reverse two-hybrid method is used as a screening assay for identifying 
small molecule inhibitors of protein-protein interaction, such that an exogenously added small 
molecule can interfere with a two-hybrid interaction. In one example, a reverse two-hybrid 
system utilizes the small molecule estradiol as the protein interaction inhibitor. Estradiol is a 

30 small lipophilic molecule that has been shown to be elective in yeast. It has been shown that 
estradiol reverses the interaction of the hormone binding domain of the estrogen receptor with 
the heat-shock protein HSP90. Thus, a first hybrid protein comprising the hormone bindmg 
domain of the estrogen receptor in polypeptide linkage to a lexA DNA-binding domain and a 
second hybrid protein comprismg the heat shock protein, HSP90. in polypeptide linkage to the 
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VP16 activation domain are constructed by standard methods. Polynucieotide(s) encoding the 
first and second hybrid proteins, a LexOGALSO relay gene construct, and a Gal4-dependent 
reporter gene construrt are introduced into the yeast host. Estrogen (e.g., estradiol) is 
evaluated as an agent for inhibiting formation of a functional two-hybrid heterodimer and 
5 thereby producing expression of the reporter gene. 

Testing for the abUitv of a polvoeptide to interfere with a two-h vbrid interaction 

The reverse two-hybrid method is used as a screening assay for identifying 
polyp^tide inhibitors of protein-protein intmiction, such that an intracellularly expressed 

10 polypq)tide can interfere with a two-hybrid interaction. In one embodiment, a reverse two- 
hybrid system utilizes a polypq)tide expressed from a cotransfected cDNA expression 
construct as the protein interaction inhibitor. 

A first hybrid protein comprising a first interacting polypeptide sequence in 
polypq)tide linkage to a lexA DNA-binding domain and a second hybrid protein comprising a 

IS second interacting polypeptide sequence in polypeptide linkage to the VP16 activiiion domain 
are constructed by standard methods. PolynucIeotide(5) encoding the first and second hybrid 
proteins, a LexO-GAL80 relay gene construct, and a Gal4-dq)radent reportet gene construct 
are introduced into the yeast host. A polynucleotide encoding and expressing a polypeptide 
typically between 5 and 500 amino acids long (e.g., a library member of a cDNA expression 

20 library) is also introduced into the yeast cells under conditions wherein the encoded 

polypq)tide is expressed intracellularly. The expressed polypeptide is evaluated as an agent 
for inhibiting formation of a functional two^iybrid heterodimer and thereby producing 
expression of the rq)orter gene. 

Essentially any of various expression clone libraries known in the art may be 

25 used, including conunercially available expression libraries (Clontech, Inc., Palo Alto, CA), 
Expression clone libraries may also be generated by the practitioner by conventional cloning 
methods and vectors known in the art (e.g., pcD, pSV), especially yeast expression vectors. 
Expression clone libraries comprise a collection of library members, each member comprising 
a cloned polynucleotide sequence (which may comprise mutation(s) or deletions), typically a 

30 cDNA sequence, operably linked to a promoter (and optionally an enhancer) which is 

transcriptionally active in the host cell so that the cloned sequence is transcribed and translated 
into a polypeptide. Genomic DNA sequences (e.g., complete structural genes or fragments 
thereoO may also serve as cloned sequences in expression libraries. Preferably, the cloned 
sequence is inserted m cloning site which facilitates the recovery of the cloned sequence free 
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from the promoter and other sequences which comprise an expression cassette. 

Expression clone library members are transferred into host cells by various 
means, including but not limited to: electroporation, lipofection, viral vector transduction, 
biolistics, and CaPO^ precipitation. Expression clone library members may be transferred 
S directly into host cells, or a relay and/or reporter polynucleotide and/or polynucleotide(s) 
encodusg the first and second hybrid proteins may be co-transfmed with expression clone 
library members into a host cdi, or a relay and/or reporter polynucleotide and/or 
polynucleotide(s) encoding the first and second hybrid proteins may be transferred into host 
cells subsequent to transfer of expression clone library members. 

10 Qoned polynucleotides can be recovered from expression clone library 

mmbers which are isolated by the scre^iing methods of the invention. Typically, cloned 
sequences are excised by restriction digestion with an enzyme(s) which cleave at the 
boundaries b^een the ends of the cloned sequence (e.g., cDNA) and the remainder of the 
expression clone library member. Alternatively, PCR preferably high-fideli^ PGR) or other 

IS amplification method (e.g., LCR) may be performed using pruners which flank the site at 
which the cloned sequence is mserted in the library member to amplify and thereby isolate the 
cloned sequence (U.S. Patent 4,683,202, incorporated herein by referrace). When PGR is 
used, it is generally preferable to incorporate known unique polynucleotide sequences flankmg 
at least one, and preferably both, side(s) of the site in which a cloned sequence is inserted to 

20 facUitate recovery of the selected cloned sequence(s). 

Although the present invention has been described in some detail by way of 
illustration for purposes of clarity of understanding, it will be apparent that certain changes 
and modifications may be practiced within the scope of the claims. 
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SEQUENCE USTING 
(!)■ GENERAL INFORMATION: 

5 

(i) APPUCANT: ERICKSON, JAMES R. 
POWERS. SCOTT 

(ii) TITLE OF INVENTION: REVERSE TWO-HYBRID METHOD 

10 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Cooley Godwaid Castro Huddleson & Tatum 
15 (B) STREET: Five Palo Alto Square, Fourth Floor 

(Q CITY: Palo Alto 

(D) STATE: California 

(E) COUNTRY: US 

(F) ZIP: 94306-2155 

20 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Roppy disk 

(B) COMPUTER: IBM PC conqmible 

(Q OPERATING SYSTEM: PC-DOS/MS-DOS 
25 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPUCATION DATA: 

(A) APPUCATION NUMBER: US 08/218,933 

(B) FILING DATE: 29-MAR-1994 
30 (C) CLASSIHCATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Torchia, Timothy E. 

(B) REGISTRATION NUMBER: 36,700 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 843-5481 

(B) TELEFAX: (415) 857-0663 

5 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 68 base pairs 

10 (B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 - . 

(ix) FEATURE: 

(A) NAME/KEY: niisc_feature 

(B) LOCATION: 1..68 

(D) OTHER INFORMATION: /standard_name= "PCR primer" 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GCGAATTCCT ACTGTATATA CATACAGTAC CATCTACTGT ATATACATAC AGTAGCCGCT 60 

30 

CQAGC66C 68 



35 
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(2) INFORMATION FOR SEQ ID NO:2: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
5 (B) TYPE: nucleic acid 

{Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(iu) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 i 
(ix) FEATURE: 

(A) NAME/KEY: inisc_featiire 

(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /standard_nanie= "PCR primer" 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
CGCGGATCCC GTTCTTTCCA CTCCCG 26 

25 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(ui) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

5 

(ix) FEATURE: 

(A) NAME/KEY: inisc_feature 

(B) LOCATION: 1..30 

(D) OTHER INFORMATION: /standard_name= "PCR primer" 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
CGGATCCGAT GGAAGGATGC CCGCTGCTGC 30 

15 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
20 (B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1..33 

(D) OTHER INFORMATION: /standard_nanie= "oligomer for PCR" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
CGGGATCCAT GACTGAATAT AAACTTGTGG TAG 33 

5 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
10 (B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..39 

(D) OTHER INFORMATION: /standard_naine= "oligomer for PCR" 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
CGGGATCCTT ACATAATTAC ACACTTTGTC TTTCACTTG 39 

30 
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1. A reverse two-hybrid system comprising: 

(1) a first hybrid protein conq)rising a first interacting polypeptide sequence in 
S polypeptide linkage to a DNA-binding domain of a transcriptional activator; 

(2) a second hybrid protein conq)rising a second interacting polypeptide 
sequence in polypeptide linkage to an activation domain of a transcriptional activator, whefein 
the second hybrid protein binds to the first hybrid protein via contact of the interacting 
polynucleotide sequences under physiological conditions; 

10 (3) a relay gene whose transcription is dependent upon the first hybrid protein 

and the second hybrid protein being bound to each other, therd)y reconstitutmg a 
transcriptional activator; and 

(4) a reporter gene whose transcrq)tion is rq)ressed by e:q>ression of the relay 
gene and which is substantially transcribed in the absence of relay gene expression. 

15 :5 

2. A reverse two-hybrid system of claim 1, wherein the two-hybrid 
system is in a yeast host cell. 

3. A reverse two-hybrid systm of claim 1, wherein the first hybrid 
20 protein, the second hybrid protein, a relay protein, and a reporter protein are encoded on at 

least one polynucleotide and said at least one polynucleotide is introduced into a yeast host 
cell. 

4. A reverse two-hybrid system of claim 1, wherein the first 
25 hybrid protein comprises a lexA DNA-binding domain in polypeptide linkage to the first 

interacting polypeptide sequence; 

the second hybrid protein con^jrises a VP16 activation domain in polypeptide 
linkage to the second interacting polypeptide sequence; 

the relay gene encodes Gal80 operably linked to a LexO sequence in a cis- 
30 linked relay gene transcription regulatory sequence; and 

the reporter gene comprises lacZ or HIS3 and is operably linked to a 
transcription regulatory sequence which confers Gal4-dependent transcription to cis-linked 
adjacent polynucleotide sequences. 



wo 95/26400 PCrAJS95/03918 

28 

5. A reverse two-hybrid system of claim 4 in a yeast cell produced by 
crossing a Saccharomvces organism having the genotype MATa his3 ade2 leu2 ura3 tys2 trpl 
GAL4^ galSOD LYS2::GAL1'HIS3 URA3::GALl-lacZ. 

6. A reverse two-hybrid system of claim 4 further comprising an agent. 

7. A reverse two-hybrid system of claim 4 further con5)rises an 
expression clone library member which expresses an intracellular polypeptide in the yeast 



10 

8. A yeast cell conq)rising a reverse two-hybrid system of claim 1 or 4. 

9. A polynucleotide encoding a GalSO polypeptide and conq>rising at least 
one operably linked LexO binding site. 

15 

10. A polynucleotide of claim 9 in a yeast cell which contains a 
functionally disrupted endogenous GALSO gene. 

11. A yeast cell containmg: 

20 a polynucleotide sequence encoding a first hybrid protein which is 

constitutively expressed; 

a polynucleotide sequence encoding a second hybrid protein whidi is 
constitutively expressed; 

a polynucleotide sequence encoding a relay protein whose expression is 
25 dependent upon mtermolecular binding of the first hybrid protein with the second hybrid 
protein under physiological conditions; and 

a polynucleotide sequence encoding a reporter protein whose expression is 
repressed by the relay protein. 

30 12. A yeast cell of claim 11, further comprising an expression clone 

library member which expresses a polypeptide encoded by a cDNA. 



13. A yeast cell of claim 11, further comprising an agent having a 
molecular weight of less than 1,000 daltons. 
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14. A method for identifying agents which inhibit intermolecular binding 
under physiological conditions between a first interacting polypeptide sequence and a second 
interacting polypeptide sequence, said method comprising the steps of: 

administering an agent to a host cell containing a reverse two-hybrid system of 
S claim 1 or 4 and incubating the host cell for a suitable period; 

determining whether the administration of the agent induces a substantially 
expression of the reporter gene; and 

identifying an agent which induces a substantially expression of the reporter 
gene as a protein interaction inhibitor. 

10 

15. A method of claim 14, wherein the agent is a molecule having a 
molecular weight less than about 1,000 daltons. 

16. A kit conq)rismg a reverse two-hybrid system, a host cell, an 
IS instruction manual, and optionally a panel of agents for testing. 



20 



17. A reverse two-hybrid system of claim 4, wherein the first interacting 
polypeptide sequence is a mammalian ras polyp^de aiKl the second interacting polypq>tide 
sequence is a Raf polypeptide. 
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